Display apparatus and method for controlling the display apparatus

ABSTRACT

An electronic apparatus is provided, which includes an output, a voice collector configured to collect a user voice, and a controller configured to control the output to output a system response corresponding to the user voice, in which the controller is further configured to control the output such that a voice command guide applicable to a current situation of the electronic apparatus is outputted.

CROSS-REFERENCE TO RELATED APPLICATION(S)

This application claims priority from Korean Patent Application No.10-2013-0001803, filed on Jan. 7, 2013, in the Korean IntellectualProperty Office, the disclosure of which is incorporated herein byreference in its entirety.

BACKGROUND

1. Field

Apparatuses and methods consistent with exemplary embodiments relate todisplaying, and more particularly, to a display apparatus configured tobe controlled according to user's voice, and a control method thereof.

2. Description of the Related Art

Different types of display apparatus are developed and supplied thesedays, thanks to advancements in electronic technology. As a result,users are provided with more varied functions. For example, televisions(TVs) are now able to connect to the Internet and provide internetservice, enabling users to view numerous broadcasts including digitalbroadcast channels.

In the related art, one of the recently-developed technologies employsvoice recognition to control a display apparatus with increasedconvenience and intuition. TVs are now able to recognize a user's voiceand perform a function corresponding to the user's voice such as volumeadjustment or channel change.

However, while the display apparatuses of the related art are capable ofperceiving user's voice to provide functions that correspond to theperceived voice, they do not provide the users with interactiveinformation. The display apparatuses also have limited use, due to thefact that the users do not know which words to use to perform voicecontrol of the display apparatus, when they first use the displayapparatuses.

SUMMARY

Exemplary embodiments address at least the above problems and/ordisadvantages and other disadvantages not described above. Also,exemplary embodiments are not required to overcome the disadvantagesdescribed above, and an exemplary embodiment may not overcome any of theproblems described above.

One or more exemplary embodiments provide a display apparatus configuredto provide voice command guide applicable to current situation of thedisplay apparatus, and a control method thereof.

According to an aspect of an exemplary embodiment, there is provided anelectronic apparatus, including: an output; a voice collector configuredto collect a user voice; and a controller configured to control theoutput to output a system response corresponding to the user voice. Thecontroller may be further configured to control the output such that avoice command guide applicable to a current situation of the electronicapparatus is outputted.

In response to the system response corresponding to the user voice beingoutputted, the controller may be further configured to control theoutput to output the voice command guide including a guide which isrelated with the system response.

In response to a list of contents, resulting from a search performed inaccordance with the user voice including a command to search forcontents, being outputted, the controller may be further configured tocontrol the output to output the voice command guide including a guideto filter the content included in the list of contents.

In response to a list of contents, resulting from a search performed inaccordance with the user voice including a command to search forcontents, being outputted, the controller may be further configured tocontrol the output to output the at least one of voice command guide toexecute specific content included in the content list, and voice commandguide to output details of the specific content.

In response to an application executing screen, corresponding to theuser voice including a command to execute the corresponding application,being outputted, the controller may be further configured to control theoutput to output the voice command guide including a guide applicable tothe application executing screen. The outputting the applicationexecuting screen may include displaying the application executing screenon a screen included in the electronic apparatus.

The electronic apparatus may be a display apparatus.

The electronic apparatus may additionally include a first communicatorconfigured to transmit the user voice to a first server and receive fromthe first server text information corresponding to the user voice, and asecond communicator configured to transmit the received text informationto a second server. In response to response information corresponding tothe text information being received from the second server, thecontroller may be further configured to control the output such that thesystem response is outputted based on the response information.

According to an aspect of another exemplary embodiment, there isprovided a control method of an electronic apparatus, including:collecting a user voice; transmitting the user voice to a first serverand receiving from the first server text information corresponding tothe user voice, transmitting the received text information to a secondserver; and in response to response information corresponding to thetext information being received from the second server, outputting asystem response corresponding to the user voice based on the responseinformation and outputting a voice command guide applicable to a currentsituation of the electronic apparatus.

In response to a list of contents, resulting from a search performed inaccordance with the user voice including a command to search forcontents, being outputted, outputting the voice command guide includinga guide to filter the content included in the list of contents.

In response to a list of contents, resulting from a search performed inaccordance with the user voice including a command to search forcontents, being outputted, outputting the voice command guide includingat least one of voice a guide to execute specific content included inthe content list, and a guide to output details of the specific content.

In response to an application executing screen, corresponding to theuser voice including a command to execute the corresponding application,being outputted, outputting the voice command guide including a guideapplicable to the application executing screen.

The electronic apparatus may be a display apparatus.

According to an aspect of another exemplary embodiment, there isprovided a method of generating a voice command guide for an electronicdevice, the method including: receiving a user voice for controlling theelectronic device; outputting a system response corresponding to theuser voice; generating the voice command guide corresponding to thesystem response; and outputting the voice command guide.

The voice command guide may include information for controlling theelectronic device including a voice command for controlling a functionthe electronic device.

The method may further include: searching for contents in response tothe user voice including a command to search for contents; outputting alist of contents, resulting from the search for contents, as the systemresponse; and outputting, in response to the outputting the list ofcontents, at least one of a guide for filtering the list of contents, aguide for executing a specific content included in the list of contents,or a guide for outputting details of a specific content included in thelist of contents, as the voice command guide.

The method may further include: executing an application in response tothe user voice including a command to execute the application;outputting an application executing screen, corresponding to theapplication, as the system response; and outputting, in response to theoutputting the application executing screen, a guide for executing afunction on the application executing screen as the voice command guide,in response to outputting the list of contents. The outputting theapplication executing screen may include displaying the applicationexecuting screen on a screen included in the electronic device.

The electronic device may include a screen. Outputting the systemresponse and the outputting the voice command guide may includedisplaying the system response and the voice command guide on thescreen.

The electronic device may include an audio output. The outputting thesystem response and the outputting the voice command guide may includeoutputting the system response and the voice command guide to the audiooutput as an audio output signal.

According to an aspect of another exemplary embodiment, there is providean electronic device including: an output; an input configured toreceive a voice of a user for controlling the electronic device; and acontroller configured to control the output to output a system responsecorresponding to the received user voice and to output a voice commandguide corresponding to the system response.

The voice command guide may include information for controlling theelectronic device including a voice command for controlling a functionthe electronic device.

The controller may be further configured to: search for contents inresponse to the user voice including a command to search for contents,control the output to output a list of contents, resulting from thesearch for contents, as the system response, and control the output tooutput, in response to the output outputting the list of contents, atleast one of a guide for filtering the list of contents, a guide forexecuting a specific content included in the list of contents, or aguide for outputting details of a specific content included in the listof contents, as the voice command guide.

The electronic device may further include a display. The controller maybe further configured to: execute an application in response to the uservoice including a command to execute the application, control the outputto output an application executing screen, corresponding to theapplication, as the system response, and control the output to output,in response to the output outputting the application executing screen, aguide for executing a function on the application executing screen asthe voice command guide, in response to outputting the list of contents.The output outputting the application executing screen and the guide forexecuting the function on the application executing screen may includeoutputting the application executing screen and the guide for executingfunction to the display to be displayed.

The electronic device may further include a display. The outputoutputting the system response and the outputting the voice commandguide may include outputting the system response and the voice commandguide to the display to be displayed.

The electronic device may further include an audio output. The outputoutputting the system response and the outputting the voice commandguide may include outputting the system response and the voice commandguide to the audio output to be output as audio.

According to various exemplary embodiments, because the voice commandguide including usable voice information is provided, even novice usersof the interactive system are able to conveniently use the system.Further, because a voice command guide related with the system response,outputted in accordance with the user's utterance, is provided, userscan naturally learn the voices that are applicable to the situation.

BRIEF DESCRIPTION OF THE DRAWINGS

The above and/or other aspects will become more apparent by describingcertain exemplary embodiments with reference to the accompanyingdrawings, in which:

FIG. 1 is a view provided to explain an interactive system according toan exemplary embodiment;

FIG. 2 is a schematic block diagram of a display apparatus according toan exemplary embodiment;

FIG. 3 is a detailed block diagram of the display apparatus of FIG. 2;

FIG. 4 is a block diagram of the first server of FIG. 1;

FIG. 5 is a block diagram of the second server of FIG. 1;

FIGS. 6 to 11 are views provided to explain a method for providing voicecommand guide according to an exemplary embodiment; and

FIG. 12 is a flowchart provided to explain a control method of a displayapparatus according to an exemplary embodiment.

DETAILED DESCRIPTION OF EXEMPLARY EMBODIMENTS

Certain exemplary embodiments will now be described in greater detailwith reference to the accompanying drawings.

In the following description, same drawing reference numerals are usedfor the same elements, even in different drawings. The matters definedin the description, such as detailed construction and elements, areprovided to assist in a comprehensive understanding of the exemplaryembodiments. Accordingly, it is apparent that the exemplary embodimentscan be carried out without those specifically defined matters. Also,well-known functions or constructions are not described in detail sincethey would obscure the exemplary embodiments with unnecessary detail.

FIG. 1 is a view provided to explain an interactive system according toan exemplary embodiment.

Referring to FIG. 1, the interactive system 1000 includes a displayapparatus 100, a first server 200, and a second server 300.

The display apparatus 100 may be controlled by a remote controller (notillustrated) provided to control the display apparatus 100. That is, thedisplay apparatus 100 may perform an operation that corresponds to areceived remote control signal from the remote controller (notillustrated). For example, when the display apparatus 100 is implementedas a TV, as illustrated in FIG. 1, the display apparatus 100 may performan operation such as power on/off, channel change, or volume change, inaccordance with a received remote control signal.

Further, the display apparatus 100 may perform various operationsaccording to the user's voice.

The display apparatus 100 transmits a collected user voice to the firstserver 200. The first server 200, on receiving the user voice from thedisplay apparatus 100, converts the received user voice into textinformation (or text) and transmits the text information to the displayapparatus 100.

The display apparatus 100 transmits the text information received fromthe first server 200 to the second server 300. The second server 300,when receiving text information from the display apparatus 100,generates response information corresponding to the received textinformation and transmits it to the display apparatus 100.

Meanwhile, the display apparatus 100 may perform various operationsbased on the response information received from the second server 300.

For example, the display apparatus 100 may perform functionscorresponding to the user voice. That is, among the functions providedby the display apparatus 100, the display apparatus 100 may perform afunction that corresponds to the user voice. For example, in response toa user voice “turn to channel #O (i.e., channel number),” the displayapparatus 100 may tune to channel #O and output the same in accordancewith a control command received from the second server 300. To thispurpose, the second server 300 may transmit a control command to thedisplay apparatus 100 to perform a function corresponding to the uservoice.

In the above example, the display apparatus 100 may also provide asystem response corresponding to the corresponding function. That is,the display apparatus 100 may output information about the functionperformed in response to the user voice in the form of at least one of avoice and a user interface (UI) screen (text, image, etc.) In the aboveexample, the display apparatus 100 may output a message in at least oneof voice and UI screen forms, to indicate the information about thechanged channel or to inform a user that the channel change iscompleted.

Further, the display apparatus 100 may output a system response inresponse to the user voice. The system response may be outputted in theform of at least one of a voice and a UI screen. For example, when auser voice, inquiring about a broadcast time of a specific program, isimputed, the display apparatus 100 may output a correspondingbroadcasting time of the program in the form of at least one of a voiceand a UI screen.

Further, the display apparatus 100 may output a voice command guide thatis implementable in the current situation. That is, when the systemresponse is outputted in response to the user voice, the displayapparatus 100 may output the voice command guide related to the systemresponse.

For example, it is assumed that the display apparatus 100 outputs a listof broadcast programs to be broadcast today, as a system response to theuser voice “What is on TV today?” In this example, the display apparatus100 may output voice command guide including information about uservoice that can be used with respect to the list of broadcast programsoutputted as the system response, which may include “What's on XXX(i.e., channel name)?”, “What is featuring XXX (i.e., appearing person'sname)?”, “Can I see XXX (i.e., program name)?”, “The third one,” or “CanI see details of the third one?”

Accordingly, when the user voice included in the voice command guide iscollected, the display apparatus 100 may perform a correspondingoperation. In the example explained above, the display apparatus 100 mayoutput a list of broadcast programs provided by the XXX (i.e., channelname) among the broadcast programs due to be broadcasted today, when thecollected user voice is “What is on XXX (i.e., channel name)?” Further,when the collected user voice is “The third one,” the display apparatus100 may tune to a third broadcast program on the list outputted as thesystem response and output the same.

As explained above, because the display apparatus 100 provides the voicecommand guide, even users of the interactive system that have not usedthe system before can conveniently use the system. Further, because thedisplay apparatus 100 provides the voice command guide that is relatedto the system response outputted in accordance with the user'sutterance, the users can naturally learn the voices that are applicableafter they make utterances.

Meanwhile, although the display apparatus 100 is implemented as a TV inFIG. 1, other embodiments are possible. For example, the displayapparatus 100 may be implemented as various electronic apparatuses suchas mobile phones such as smart phones, desktop personal computers (PCs),laptop PCs, or global positioning systems (GPSs).

Further, although the first and second servers 200, 300 are implementedas separate servers in FIG. 1, other embodiments are possible. For oneexample, the first and second servers 200, 300 may be implemented as oneinteractive server.

FIG. 2 is a schematic block diagram of a display apparatus according toan exemplary embodiment.

Referring to FIG. 2, the display apparatus 100 includes an output 110, avoice collector 120, and a controller 130.

The output 110 outputs at least one of a voice and an image. Forexample, the output 110 may output a system response, corresponding tothe user voice collected at the voice collector 120, in the form of atleast one of a voice and a graphic UI (GUI).

To that purpose, the output 110 may include a display (not illustrated)and an audio output (not illustrated).

The display (not illustrated) may be implemented as a liquid crystaldisplay (LCD), an organic light emitting display (OLED), or plasmadisplay panel (PDP). The display (not illustrated) may provide variousdisplay screens that can be provided through the display apparatus 100.For example, the display (not illustrated) may construct a UI screen,based on the system response to the user voice, including at least oneof text, an image, an icon, and a GUI, and display the same.

Meanwhile, the audio output (not illustrated) may be implemented as anoutput port, such as a jack or a speaker, to output system response tothe user voice in the voice form.

Further, the output 110 may output various contents. The contents mayinclude broadcast contents, video on demand (VOD), premium VOD, digitalversatile disc (DVD) content, etc. For example, the display (notillustrated) may output an image that constitutes the contents, and theaudio output (not illustrated) may output an audio that constitutes thecontents.

The voice collector 120 may collect the user voices. For example, thevoice collector 120 may be implemented as a microphone to collect theuser voices, either inside the display apparatus 100 as an embeddedpart, or separate from the display apparatus 100. When implemented asseparate from the display apparatus 100, the voice collector 120 may beimplemented in a form that can be easily grabbed by the user or that canbe laid on a table, etc. The voice collector 120 may be connected to thedisplay apparatus 100 by wires or a wireless network, to transmit thecollected user voice to the display apparatus 100.

Meanwhile, the voice collector 120 may determine whether the collectedvoice belongs to the user or not, and filter noise (e.g., noise from airconditioner, vacuum cleaner, or music) from the collected user voice.

For example, when an analog form of the user voice is inputted, thevoice collector 120 may sample the same and convert into it into adigital signal. The voice collector 120 may calculate the energy of theconverted digital signal to determine whether the energy of the digitalsignal is equal to or greater than a preset value or not.

When the energy of the digital signal is equal to or greater than thepreset value, the voice collector 120 may remove a noise component fromthe digital signal and transmit the resultant signal to the firstcommunicator 130. The noise component, which may be sporadic in nature,may be generated from the domestic environment and may include, forexample, an air conditioner sound, a vacuum cleaner sound, or music.Meanwhile, when the energy of the digital signal is less than the presetvalue, the voice collector 120 may wait for another input, instead ofprocessing the digital signal.

Accordingly, since the entire audio processing is not activated bysounds other than a user voice, unnecessary power consumption can beprevented.

As shown in FIG. 1, the first communicator 130 may perform communicationwith the first server 200. For example, the first communicator 130 maytransmit a user voice to the first server 200, and receive from thefirst server 200 the text information corresponding to the user voice.

As shown in FIG. 2, the second communicator 140 performs communicationwith the second server 300. For example, the second communicator 140 maytransmit the received text information to the second server 300 andreceive from the second server 300 the response information thatcorresponds to the text information.

Accordingly, the first and second communicators 130, 140 may performcommunication with the first and second servers 200, 300 using variouscommunication methods.

For example, the first and second communicators 130, 140 may performcommunication with the first and second server 200, 300 usingwired/wireless local area network (LAN), WAN, Ethernet, Bluetooth,Zigbee, universal serial bus (USB), IEEE 1394, or WiFi. To that purpose,the first and second communicators 130, 140 may employ chips or inputports that correspond to the communication method. For example, whencommunicating in a wired LAN manner, the first and second communicators130, 140 may include a wired LAN card (not illustrated) and an inputport (not illustrated).

In the exemplary embodiments explained above, to communicate with thefirst and second servers 200, 300, the display apparatus 100 may includeseparate communicators 130, 140. However, other embodiments arepossible. For example, the display apparatus 100 may communicate withthe first and second servers 200, 300 via one communication module.

Further, the first and second communicators 130, 140 may communicatewith the first and second servers 200, 300, but are not limited thereto.For example, the first and second communicators 130, 140 may beconnected to a web server (not illustrated) to perform web browsing.

The controller 150 may control the overall operation of the displayapparatus 100. That is, the controller 150 may control the operations ofthe output 110, the voice collector 120, and the first communicator andthe second communicator 130, 140. The controller 150 may include amodule to control a central processing unit (CPU) and the displayapparatus 100, and read only memory (ROM) and random access memory (RAM)to store data.

For example, the controller 150 may collect user voices through thevoice collector 120 and control the first communicator 130 to transmitthe collected user voices to the first server 200. The controller 150,when receiving text information corresponding to the user voice, maycontrol the second communicator 140 to transmit the received textinformation to the second server 300.

Meanwhile, the controller 150 may control the output 110 to output asystem response that corresponds to the user voice. For example, whenreceiving response information corresponding to the text information,the controller 150 may control the output 110 to output a systemresponse corresponding to the user voice based on the responseinformation.

The response information may include system response information tooutput the system response.

In one exemplary embodiment, the system response information may be in atext form that represents the system response outputted from the displayapparatus 100 in response to the user voice, and the controller 150 maycontrol the output 110 such that the system response corresponding tothe user voice is outputted as at least one of a voice and a UI screenin response to the user voice, based on the system response information.

More specifically, using a text to speech (TTS) engine, the controller150 may convert the system response information in text form into avoice and output the same through the output 110. The TTS engine is amodule configured to convert text into speech, and it is thus possibleto convert text into voice using various known TTS algorithms. Further,the controller 150 may control the display apparatus 100 such that a UIscreen, including the text constituting system response information, isconstructed and outputted via the output 110.

For example, when the display apparatus 100 implemented as TV collectsthe user voice “Let me know the most popular program,” the second server300 expresses in the text form “The most popular program is XXX (i.e.,program name)” and transmit the same to the display apparatus 100. Inthis example, the controller 150 may convert the text form of “The mostpopular program is XXX (i.e., program name)” into a voice and output thesame via the output 110, or construct a UI screen that includes the text“The most popular program is XXX (i.e., program name)” and output thesame via the output 110. Further, the controller 150 may output acombination of a voice and a GUI.

In the exemplary embodiments explained above, the system responseinformation transmitted from the second server 300 may be in the textform expressing the system response outputted from the display apparatus100. However, other embodiments are possible. For example, the systemresponse information may include voice data that constructs the systemresponse outputted from the display apparatus 100, or part of the voicedata that constructs the system response. Alternatively, the systemresponse information may be in a control signal form to output thesystem response using voice or text previously stored in the displayapparatus 100.

Accordingly, the controller 150 may output the system response byconsidering the form of the system response information.

For example, when the voice data that constructs the system response isreceived, the controller 150 may process the corresponding data into aform outputtable via the output 110 and output the same in a voice form.

On the other hand, i.e., when the control signal to output the systemresponse is received, the controller 150 may search pre-stored data ofthe display apparatus 100 for the data that matches the control signalpre-stored, and process the voice or text data that is found into anoutputtable form so that the same is outputted via the output 110. Tothat purpose, the display apparatus 100 may store in advance the voiceor text data to provide the system response. For example, the displayapparatus 100 may store the data in a complete sentence form such as“The channel change is completed,” or may store the data in a partialsentence form such as “The channel is changed to . . . ” In the lattercase, the channel name that can complete the sentence may be receivedfrom the second server 300.

In another exemplary embodiment, the system response information mayinclude various information so as to output the system response from thedisplay apparatus 100 in response to the user voice.

For example, the system response information may include informationabout the content that is searched according to an intention of theuser's utterance. That is, the second server 300 may search the contentsthat match the intention of the utterance included in the user voice,and transmit to the display apparatus 100 information about the searchedcontent (e.g., at least one of content name, genre, broadcast time,cast, thumbnail image corresponding to the content, channel name thatbroadcasts the content, channel number, and staff).

For example, when the display apparatus 100 implemented as TV collectsthe user voice “What's on TV today?”, the second server 300 may searchfor the broadcast program due to be broadcast today based on anelectronic program guide (EPG), and transmit the information about thebroadcast program to the display apparatus 100.

Accordingly, the controller 150 may generate a content list based on theinformation received from the second server 300 and output the generatedcontent list. In one exemplary embodiment, the controller 150 may outputthe content list in the form of at least one of a voice and a UI screen.

For example, as a system response to the user voice, the controller 150may construct a UI screen that includes a list of programs scheduled tobe broadcasted today, and output the same. The controller 150 maygenerate a content list in which the searched content is listed in theorder of broadcasting time.

The controller 150 may also add various information related to thesearched content to the search list. For example, the controller 150 mayadd to the search list at least one of the name of the searchedbroadcast program, broadcasting time, channel name that provides thebroadcast program, channel number, and production company, based on theinformation about the broadcast program received from the second server300.

While the second server 300 may transmit the information about thecontent to the display apparatus 100 and the display apparatus 100 maygenerate a content list using the received information and output thisas a system response, this is only an example.

Accordingly, other embodiments are possible. For example, the secondserver 300 may transmit information about a determined intention of theutterance by the user to the display apparatus 100 and the displayapparatus 100 may output the system response corresponding to the uservoice based on the information about the determined intention of theutterance included in the user voice.

For example, when the second server 300 determines the intention of theutterance included in the user voice to be a request for informationabout the broadcast programs scheduled to be broadcasted today, thesecond server 300 may transmit to the display apparatus 100 a controlcommand to output a list of the broadcast programs scheduled to bebroadcasted today. Accordingly, the controller 150 may search for thebroadcast programs scheduled to be broadcasted today based on the EPG,and output the list of the broadcast programs as a system response.

Meanwhile, the response information may include a control command tocontrol the function of the display apparatus 100. The control commandmay include a command to execute a function that corresponds to the uservoice, from among the functions executable on the display apparatus 100.Accordingly, based on the control command received from the secondserver 300, the controller 150 may control the respective components ofthe display apparatus 100 to execute the function corresponding to theuser voice.

For example, when the display apparatus 100 collects the user voice“turn up the volume,” the second server 300 may transmit a controlcommand, to increase the volume of the display apparatus 100, to thedisplay apparatus 100. In one exemplary embodiment, the controller 150may increase the volume of the audio outputted via the output 110 basedon the control command. Further, when the display apparatus 100 collectsthe user voice “Execute web browsing, please,” the second server 300 maytransmit a control command, to execute the application related to theweb browsing, to the display apparatus 100. In one exemplary embodiment,the controller 150 may execute the application for web browsing amongthe pre-stored applications, based on the control command.

However, other embodiments are also possible. For example, thecontroller 150 may control the respective components of the displayapparatus 100 to perform various operations including power on/off,channel change, etc. in response to the collected user voice.

Further, the response information may include system responseinformation related to the function that is executed in response to thecontrol command. In one exemplary embodiment, the controller 150 mayperform a function according to a control command, and control thedisplay apparatus 100 such that the system response related to theexecuted function is outputted in a form of at least one of a voice anda UI screen, based on the system response information.

For example, when the display apparatus 100 implemented as TV collectsthe user voice “Turn to channel #11,” the second server 300 may transmitto the display apparatus 100 a control command to change the channel ofthe display apparatus 100 to channel #11, along with a text form thatexpresses “Channel is changed to #11.” In one exemplary embodiment, thecontroller 150 may change the broadcast channel outputted via the output110 to channel #11 based on the control command. The controller 150 maythen convert the text form that expresses “Channel is changed to #11” toa voice saying “Channel is changed to #11” and output the same via theoutput 110, or construct a UI screen that includes a text “Channel ischanged to #11” and output the same via the output 110.

As explained above, the controller 150 may perform various operationsaccording to the collected user voice. That is, the controller 150 mayoutput a system response that corresponds to the user voice or controlthe display apparatus 100 such that a function corresponding to the uservoice is executed.

Meanwhile, a system response corresponding to the user voice may beoutputted without a separate function of the display apparatus 100 beingexecuted, when the user voice directs to the display apparatus 100 toexecute a function that cannot be executed on the display apparatus 100.

For example, for purposes of explanation it is assumed that the displayapparatus 100 is implemented as a TV that does not support telephonecall function. In this case, when the display apparatus 100 collectsuser voice “Call to XXX (i.e., another user), please,” the second server300 may transmit a control command to perform telephone call function tothe display apparatus 100. However, because the display apparatus 100does not support a function that corresponds to the control command, thecontroller 150 is not able to perceive the control command received fromthe second server 300. In this case, the controller 150 may output asystem response “The function is not supported” via the output 110 inthe form of at least one of a voice and a GUI.

Meanwhile, the controller 150 may control the display apparatus 100 suchthat a voice command guide, which can be used in the current situationof the display apparatus 100, is outputted.

The “current situation” may include a situation where a specificfunction is being executed in the display apparatus 100, such as, theinitial screen being outputted, content being outputted, or anapplication being executed. Further, the initial screen may include ascreen that is first provided in an interactive mode in which thedisplay apparatus 100 is controlled by the user voice. For example, whena specific key provided on a manipulation panel of the display apparatus100 is selected, or when a specific remote control signal is receivedfrom a remote controller (not illustrated), the controller 150 mayoperate in the interactive mode to display the initial screen andcollect the voices uttered by the user.

More specifically, the controller 150 may output a voice command guidethat represents the user voices that can execute a specific operation ina situation where a specific function is executed such as initial screenoutput, content output, or application executing.

For example, in a situation where the initial screen is being outputted,the voice command guide, including the user voice that can execute anoperation executable on the display apparatus 100, such as, for example,“What is on TV today?”, “Anything fun?”, “Any new movies?”, “Recommendpopular one,” “Turn to XXX (i.e., channel name),” “Execute web browser,please,” may be outputted.

Further, in a situation where the content is being outputted, the voicecommand guide, including the user voice that is applicable to thecontent, may be outputted. For example, the voice command guide,including a user voice that can search the content or control (i.e.,change the content or change the volume) the content, for example, “Whatis on TV today?”, “Anything fun?”, “Any new movies?”, “Recommend popularone,” “Turn to XXX (i.e., channel name),” or “Turn up the volume,” maybe outputted in a situation where the content is being outputted.

Further, in a situation where the application is being executed, thevoice command guide including the user voice that is applicable to anapplication executing screen, may be outputted. For example, the voicecommand guide, including a user voice that is applicable for executing aspecific function on a web page screen, such as, for example, “homepage,” “favorites,” “refresh,” “open new page,” “close current page,”“backward,” “forward,” or “end,” may be outputted in a situation wherethe application for web browsing is being executed and a web page screenis subsequently being displayed.

When the system response corresponding to the user voice is outputted,the controller 150 may output a voice command guide related to thesystem response. That is, the controller 150 may output a systemresponse that matches the intention of the user utterance included inthe user voice, and output a voice command guide that represents theuser voice that can execute a specific operation in response to theoutputted system response.

More specifically, when the content list, searched for in accordancewith the user voice to search content, is outputted, the controller 150may output a voice command guide to filter the contents from the contentlist that includes the contents. That is, when the user voice withutterance intention to search content is received, the controller 150may output a list of contents, searched for according to the user voice,as a system response, and output, as a voice command guide, theinformation about the user voice that can be used to re-search aspecific content among the contents on the list.

For example, when the user voice “What is on TV today?” is collected,the controller 150 may output a system response based on the list ofbroadcast programs scheduled to be broadcasted today. In one exemplaryembodiment, the controller 150 may output a voice command guide based onthe information about the user voice that can be used for selecting aspecific content from the list of the broadcast programs scheduled to bebroadcasted today, such as, for example, “What is on XXX (i.e., channelname)?”, “Can I see documentary program?”, “Can I see program thatfeatures XXX (i.e., appearing person's name)?”, “Find XXX (i.e., programname)”.

Further, when the list of the contents, searched for in response to theuser voice to search content, is outputted, the controller 150 mayoutput at least one of voice command guide to execute a specific contentincluded in the content list, and voice command guide to output detailsof the specific content. The “details” may include at least one of aname of the content, broadcasting time, cast, theme, channel number thatprovides the content, and channel name.

That is, when the user voice with the utterance intention to searchcontent is received, the controller 150 may output a list of contents,searched for in response to the user voice, as a system response, andoutput a voice command guide about information on the user voices whichcan be used for executing a specific content among the contents on thelist, or for outputting details of a specific content.

For example, when the user voice “What is on TV today?” is collected,the controller 150 may output a system response based on a list ofbroadcast programs scheduled to be broadcasted today. In one exemplaryembodiment, the controller 150 may output voice command guide aboutinformation on the user voices which can be used for executing aspecific broadcast program on the list of the broadcast programsscheduled to be broadcasted today, or for outputting details of aspecific broadcast program such as, for example, “The third one,” or“Can I see the details of the third one?”

When a corresponding application executing screen is outputted inresponse to a user voice to execute an application, the controller 150may output a voice command guide which is applicable on an applicationexecuting screen. That is, when a user voice with an utterance intentionto execute an application is received, the controller 150 may execute anapplication corresponding to the user voice and output a voice commandguide based on the information about the user voice that can be used forexecuting a specific function in the corresponding application.

For example, when a user voice “Execute a web browser” is collected, thecontroller 150 may execute an application for web browsing and output aweb page screen. In one exemplary embodiment, the controller may outputvoice command guide based on the information about user voice which maybe used for executing a specific function on the web page screen, suchas, for example, “home page,” “favorites,” “refresh,” “open new page,”“close current page,” “backward,” “forward,” or “end.”

In one exemplary embodiment, the controller 150 may output voice commandguide at a preset period or at a time of executing a specific function.

For example, the controller 150 may output a voice command guide that isapplicable in the current situation of the display apparatus 100 at apreset period. Further, the controller 150 may output the voice commandguide at a time when the initial screen is outputted, a content isoutputted, an application is executed, or a system responsecorresponding to user voice is outputted. Further, the controller 150may output the voice command guide at a time when the initial screen isoutputted, a content is outputted, an application is executed, or asystem response is outputted corresponding to the user voice, andre-output the voice command guide at preset time period.

To that purpose, the display apparatus 100 may store information aboutuser voices that are applicable in the current situation. That is, thedisplay apparatus 100 may store information about the user voices thatcan be used in each situation. Accordingly, the controller 150 maydetermine information about the user voices that are applicable in thecurrent situation of the display apparatus 100 and output the voicecommand guide based on the information about the determined user voices.

For example, when a list of contents, searched for according to a uservoice to search contents, is outputted, the controller 150 may determinethat the user voices to filter content, execute specific content, oroutput details of specific contents are usable. Accordingly, among thepre-stored information, the controller 150 may detect, from thepre-stored information, the information about the user voices that canbe used in the situation where the content list is outputted, and outputthe detected information as the voice command guide.

Meanwhile, in certain exemplary embodiments, the display apparatus 100may store information about user voices that are applicable in thecurrent situation, although not limited thereto. Accordingly, in anotherexemplary embodiment, the second server 300 may transmit informationabout voice command guide that can be outputted in the current situationto the display apparatus 100 and the controller 150 may output the voicecommand guide using the received information.

For example, in response to the user voice “What is on TV today?” thesecond server 300 may transmit information about broadcast programsscheduled to be broadcasted today to the display apparatus 100. In thiscase, considering that the display apparatus 100 outputs a systemresponse based on the list of broadcast programs scheduled to bebroadcasted today using the received information, the second server 300may transmit to the display apparatus 100 the information about the uservoices that are applicable to the list of broadcast programs outputtedas the system response, such as, for example, “The third one,” “Can Isee the details of the third one?”, “What is on XXX (i.e., channelname)?”, “Can I see documentary program?”, “Can I see program thatfeatures XXX (i.e., appearing person's name)?”, “Find XXX (i.e., programname).” In one exemplary embodiment, the controller 150 may output thevoice command guide based on the information about the user voices whichcan be used on the list of broadcast programs using the receivedinformation.

FIG. 3 is a detailed block diagram of the display apparatus of FIG. 2.Referring to FIG. 3, the display apparatus 100 may additionally includea storage 160, a receiver 170, a signal processor 180, a remote controlsignal receiver 191, an input 193, and an interface 195, in addition tothe components illustrated in FIG. 2. The components may be controlledby the controller 150. Meanwhile, for the sake of brevity, theoverlapping components between FIGS. 2 and 3 will not be redundantlyexplained below.

The storage 160 may be implemented as a storage medium, such as, forexample, a memory, or a hard disk drive (HDD), that stores programs orthe like that are necessary for the operation of the display apparatus100. For example, the storage 160 may include a ROM to store programs toperform operation of the controller 150, and a RAM to temporarily storethe data according to the operation of the controller. The storage 160may additionally include an electrically erasable and programmable ROM(EEPROM) to store various reference data.

The storage 160 may store information about the user voices which areapplicable in the current situation of the display apparatus 100.Accordingly, the controller 150 may determine information about the uservoices that are applicable in the current situation of the displayapparatus 100 and control the display apparatus 100 such that thedetermined user voices are outputted in a list form.

The receiver 170 receives various contents. More specifically, thereceiver 170 may receive contents from a web server that transmitscontent files, using a broadcasting station that sends out broadcastprograms using a broadcast network or the web server that sends outcontent files using the Internet.

In one exemplary embodiment of receiving content from a broadcastingstation, the receiver 170 may be implemented as a form that includes atuner (not illustrated), a decoder (not illustrated), or an equalizer(not illustrated). In another exemplary embodiment where the content isreceived from a source such as a web server, the receiver 170 may beimplemented as a network interface card (not illustrated).

The signal processor 180 may perform signal processing of the content sothat the content received via the receiver 170 or the interface 195 isoutputted via the output 110.

More specifically, the signal processor 180 may convert the imageincluded in the content into a form that can be outputted at the display111, by performing operations such as decoding, scaling, or frame rateconversion. Further, the signal processor 180 may convert the audiosignal included in the content into a form that can be outputted at theaudio output 112 by performing signal processing such as decoding.

The remote control signal receiver 191 receives a control signalinputted from an external remote controller. Based on the control signalinputted to the remote control signal receiver 191, the controller 150may perform various operations. For example, the controller 150 mayperform an operation including power on/off, channel change, volumeadjustment, or the like, in response to a control signal inputted at theremote control signal receiver 191.

The input 193 receives various user commands. The controller 150 mayexecute an operation corresponding to the user command inputted to theinput 193. For example, the controller 150 may perform power on/off,channel change, or volume adjustment, in response to user commandinputted to the input 193.

To that purpose, the input 193 may be implemented as an input panel. Theinput panel may be a touch pad, or a key pad including various functionkeys, numeral keys, special keys, or letter keys, or a touch screen.

The interface 195 performs communication with an external device (notillustrated). The external device (not illustrated) may be implementedas various types of electronic devices which can transmit content to thedisplay apparatus 100.

For example, when the display apparatus 100 is implemented as TV, theexternal device (not illustrated) may be implemented as various forms ofelectronic devices which are connected to the display apparatus 100 andwhich can perform functions such as, for example, a set-top box, anaudio device, or a game player.

For example, the interface 195 may communicate with an external device(not illustrated) according to various wired communication methods suchas HDMI or USB, or wireless communication such as Bluetooth or Zigbee.To that purpose, the interface 195 may include a chip or an input portthat corresponds to each communication method. For example, forcommunication with an external device (not illustrated) based on theHDMI communication method, the interface 195 may include a HDMI port.

FIG. 4 is a block diagram of the first server illustrated in FIG. 1.Referring to FIG. 4, the first server 200 may include a communicator 210and a controller 220.

The communicator 210 may communicate with the display apparatus 100.That is, the communicator 210 receives the user voice from the displayapparatus 100 and transmits text information corresponding to the uservoice to the display apparatus 100. To that purpose, the communicator210 may include various communication modules.

The controller 220 controls the overall operation of the first server200. In particular, when the user voice is received from the displayapparatus 100, the controller 220 generates text informationcorresponding to the user voice and controls the first server 200 suchthat the generated information is transmitted to the display apparatus100.

More specifically, the controller 220 may generate text informationcorresponding to the user voice, using a speech to text (STT) engine.The STT engine is a module that converts a speech signal (i.e. audiosignal) into text. It is possible to convert the speech signal into textusing various known STT algorithms.

For example, the controller 220 determines a voice interval, bydetecting the beginning and ending of a voice uttered by the user in thereceived voice of the user. More specifically, the controller 220calculates the energy of the received voice signal, categorizes theenergy level of the voice signal according to the calculated energy, anddetects the voice interval based on the dynamic programming. Thecontroller 220 may detect a phoneme, i.e., the minimum unit of voice, inthe detected voice interval based on an acoustic model, and convert theuser voice into text by applying a hidden Markov model (HMM) to thegenerated phoneme data.

FIG. 5 is a block diagram of the second server of FIG. 1. Referring toFIG. 5, the second server 300 may include a communicator 310, a storage320, and a controller 330.

The communicator 310 performs communication with the display apparatus100. More specifically, the communicator 310 may receive textinformation from the display apparatus 100 and transmit responseinformation corresponding to the text information to the displayapparatus 100. To that purpose, the communicator 310 may include variouscommunication modules.

Further, the communicator 310 may be connected to a web server (notillustrated) and perform web browsing. That is, the controller 330 maytransmit various keywords to the web server (not illustrated) via thecommunicator 310 to perform web browsing, and receive the resultant websearch corresponding to the keywords.

The storage 320 stores various information to generate responseinformation that corresponds to the text information received from thedisplay apparatus 100.

That is, the storage 320 may store interaction patterns per servicedomains. The service domains may be categorized, according to the topicof the voices uttered by the user, into “broadcast,” “VOD,” “applicationmanagement,” “device control,” “information provision (weather, stocktrading, news, etc.),” etc., although not limited thereto. Accordingly,the service domains may be categorized according to various othertopics.

More specifically, the storage 320 may include a corpus database perservice domains. The corpus database may take the form in which samplesentences and response to the same are stored.

That is, the storage 320 may store a plurality of sample sentences andresponses to those, for each of the service domains. The storage 320 mayalso store, by tagging, the information to interpret the samplesentences and responses to the respective sample sentences.

For example, it is assumed that the sample sentence “When is XXX (i.e.,broadcast program name) on?” is stored in the broadcast service domain.

In the above example, the storage 320 may store, by tagging, theinformation to interpret the sample sentence “When is XXX (i.e.,broadcast program name) on?” That is, regarding the sample sentence“When is XXX (i.e., broadcast program name) on?” the storage 320 maystore, by tagging, the information that indicates that “XXX (i.e.,broadcast program name)” refers to the name of the broadcast program,that “When” refers to inquiry about broadcasting time, and that “?”refers to question. Further, the storage 320 may tag and store theinformation to the corresponding sample sentence, which indicates thatthe phrase “When is ˜ on?” needs a word related with broadcast programto fill the same. The word related with the broadcast program mayinclude the name of the broadcast program, appearing persons, orproduction manager.

Further, the storage 320 may tag a response to the sample sentence “Whenis XXX (i.e., broadcast program name) on?” and store the same. Morespecifically, the storage 320 may tag “The <broadcast program name> youinquired about is broadcasted at <broadcast time>” and store the same asa response to the sample sentence “When is XXX (i.e., broadcast programname) on?”

In another exemplary embodiment, it is assumed that a sample sentence“What is on TV today?” is stored in the broadcast service domain.

In the above example, the storage 320 may tag information to interpretthe sample sentence “What is on TV today?” to the sample sentence andstore the same. More specifically, the storage 320 may tag and store theinformation to the corresponding sample sentence, that the “on TV”refers to a broadcast program, “today,” “on TV” or “What” indicatesinquiry about a broadcast program scheduled to be broadcasted today, and“?” indicates that the sample sentence is a question. Further, thestorage 320 may tag and store the information that words related withthe broadcast time (e.g., now, tomorrow, etc.) may fill the samplesentence with the pattern of “What is on TV ˜?”

In yet another exemplary embodiment, it is assumed that the samplesentence “Change to #X (i.e., channel number)” is stored in the devicecontrol service domain.

In the above example, the storage may tag and store the information tointerpret the sample sentence like “Change to #X (i.e., channel number),please.” More specifically, in the sample sentence “Change to #X (i.e.,channel number), please,” the storage 320 may tag and store theinformation to the corresponding sample sentence, which indicates that“#X (i.e., channel number)” refers to a channel number, that “Change”indicates a command to tune to a channel, and that “please” indicatesthat the sample sentence is a request sentence. Further, the storage 320may tag and store the information in the corresponding sample sentence,which indicates that the word related with a broadcast program may fillthe sample sentence with the pattern of “Change to ˜, please.” The wordsrelated with a broadcast program may include channel number, channelname, broadcast program name, appearing persons, or production manager.

The storage 320 may also tag and store a response to the sentence“Change to #X (i.e., channel number), please.” More specifically, thestorage 320 may tag and store a response “Channel changed to <channelnumber>” to the sentence “Change to #X (i.e., channel number), please.”

In yet another exemplary embodiment, it is assumed that a samplesentence “Execute XX (i.e., application name), please” is stored in thedevice control service domain.

In the above example, the storage 320 may tag and store the informationto interpret the sample sentence “Execute XX (i.e., application name),please” to the sample sentence. More specifically, in the samplesentence “Execute XX (i.e., application name),” the storage 320 may tagand store the information which indicates that “XX (i.e., applicationname)” refers to the name of the application, that “Execute” refers to acommand to execute the application, and that “please” indicates that thesample sentence is a request sentence.

Further, the storage 320 may tag and store the information to the samplesentence, which indicates that the word related with function executableon the display apparatus 100 may fill the sample sentence with thepattern “Execute ˜, please.” The word related with the functionexecutable on the display apparatus 100 may include names ofapplications, and names of functions executable on the display apparatus100 such as a telephone call, for example.

In still another exemplary embodiment, it is assumed that the samplesentence “See more” is stored in the device control service domain.

In the above exemplary embodiment, the storage 320 may tag and store theinformation to interpret the sample sentence “See more” to the samplesentence. That is, the storage 320 may tag and store the information tothe sample sentence such as “See more,” which indicates that the samplesentence includes an utterance intention that requests information aboutthe voice usable on the display apparatus 100.

The storage 320 may also tag and store control commands to control thedisplay apparatus 100 with the respective sample sentences. Inparticular, the storage 320 may tag and store the control commands tocontrol the display apparatus 100 to the sample sentences correspondingto the user voices to control the display apparatus 100.

In the exemplary embodiments explained above, the storage 320 may tagand store the control command to change a channel of the displayapparatus 100 to channel #X, to the sample sentence such as “Change to#X (i.e., channel number), please.” Further, the storage 320 may tag andstore a control command to output voices that are usable on the displayapparatus 100, to the sample sentence such as “See more.”

Meanwhile, the exemplary embodiments are not limited to the storedsample sentences and responses exemplified above. Accordingly, variousother sample sentences and corresponding responses may be stored perservice domain.

The controller 330 controls the overall operation of the second server300. More specifically, when the text information corresponding to theuser voice is received from the display apparatus 100, the controller330 generates response information corresponding to the received textinformation, and controls the communicator 310 such that the generatedresponse information is transmitted to the display apparatus 100 via thecommunicator 310.

The controller 330 may determine the utterance intention included in theuser voice by analyzing the text information, generate responseinformation that corresponds to the determined utterance intention, andcontrol the communicator 310 to transmit the same to the displayapparatus 100.

To that purpose, the controller 330 may detect the corpus database thatincludes interaction patterns matching the received text information,and determine the service domain to which the user voice belongs.

More specifically, the controller 330 may compare the received textinformation with the samples stored per service domains, and determinethat the domain that includes the sample sentence matching the receivedtext information to be the domain to which the user voice belongs.

For example, when receiving text such as “When is XXX (i.e., broadcastprogram name) on?” or “What is on TV today?” from the display apparatus100, the controller 330 may determine that the user voice collected atthe display apparatus 100 belongs to the broadcast service domain. Whenreceiving the text such as “Change to #X (i.e., channel number), please”or “Execute XX (i.e., application name), please” from the displayapparatus 100, the controller 330 may determine that the user voicecollected at the display apparatus 100 belongs to the device controlservice domain.

When no sample sentence is found to be matching the received textinformation, the controller 330 may determine a domain where the uservoice belongs, based on statistics.

By way of example, it is assumed that the display apparatus 100 collects“Can I change to #X (i.e., channel number), please” and that the textcorresponding to the collected user voice is transmitted to the secondserver 300. In the above example, the controller 330 may determine thatthe user voice is statistically similar to “Change to #X (i.e., channelnumber), please,” using categorization models such as the hidden Markovmodel (HMM), condition random fields (CRF) or support vector machine(SVM), and determine that “Can I change to #X (i.e., channel number),please” belongs to the device control service domain.

The storage 320 may also store the text information that is determinedto be statistically similar to the pre-stored sample sentence. In oneexemplary embodiment, the storage 320 may store the text information asanother sample sentence of the service domain where the statisticallysimilar sample sentence belongs. Further, by referring to pre-storedsample sentences, the storage 320 may tag and store the information tointerpret the newly-stored sample sentences, to the correspondingnewly-stored sample sentence.

For example, it is assumed that the sentence “Can I change to #X (i.e.,channel number), please” is stored as a new sample sentence.

In the above example, the storage 320 may tag and store the informationto interpret the sample sentence such as “Can I change to #X (i.e.,channel number), please” to the corresponding sample sentence, byreferring to “Change to #X (i.e., channel number), please.” That is, thestorage 320 may tag and store the information to the correspondingsample sentence “Can I change to #X (i.e., channel number), please,”which indicates that “#X (i.e., channel number)” refers to a number ofchannel, that “change” refers to a command to tune to a channel, andthat “Can I ˜, please” indicates that the sample sentence is a requestsentence. Further, the storage 320 may tag and store the information tothe corresponding sample sentence, which indicates that word relatedwith broadcast program may fill the sample sentence with the pattern“Can I change to ˜, please”. The word related with the broadcast programmay include the channel number, broadcast station name, broadcastprogram name, appearing persons, or production manager.

Further, the storage 320 may tag and store the response to “Can I changeto #X (i.e., channel number), please,” to the corresponding sentence.More specifically, the storage 320 may tag and store “Channel changed to<channel number>” as a response to “Can I change to #X (i.e., channelnumber), please.”

Meanwhile, when there are a plurality of sample sentences matching thetext information received from the display apparatus 100 and when theplurality of sample sentences belong to different service domains, thecontroller 330 may determine the corresponding service domains of theuser voices based on statistical analysis.

That is, the controller 330 may weight the respective words (or,morphemes) constructing the text information received from the displayapparatus 100 based on the frequency the words (or, morphemes) which arepresent in the respective service domains, and determine the servicedomain corresponding to the user voice by considering the appliedweights.

For example, it is assumed that the text “Change to XXX (i.e., broadcastprogram name), please” is received from the display apparatus 100, in astate where the sample sentence “Change to XXX (i.e., broadcast programname), please” is stored in the broadcast service domain and the VODservice domain, respectively.

In the above example, the controller 330 may determine that the samplesentence matching “Change to XXX (i.e., broadcast program name), please”is present in the broadcast service domain and the VOD service domain,and weight words (or morphemes) that construct “Change to XXX (i.e.,broadcast program name), please”, i.e., weight “change” and “please” forthe respective service domains, based on the frequency that “change” and“please” are present in the respective service domains.

For example, the controller 330 may calculate the weight of “change” inthe broadcast service domain based on the rate of the sentences thatinclude “change” therein in the all sample sentences stored in thebroadcast service domain, and calculate the weight of “please” in thebroadcast service domain based on the rate of the sample sentences thatinclude “please” among all the sample sentences stored in the broadcastservice domain.

In the likewise manner explained above, the controller 330 may calculatethe weight of “change” in the VOD service domain based on the rate ofthe sentences that include “change” therein in the all sample sentencesstored in the VOD service domain, and calculate the weight of “please”in the VOD service domain based on the rate of the sample sentences thatinclude “please” among all the sample sentences stored in the VODservice domain.

Thereafter, the controller 330 may determine the service domain that theuser voice belongs to, by calculating the weights applied to therespective words. Accordingly, in the example explained above, thecontroller 330 may compare the product of multiplying the weight appliedto “change” by the weight applied to “please” in the broadcast servicedomain, with the product of multiplying the weight applied to “change”by the weight applied to “please” in the VOD service domain, anddetermine that the user voice belongs to the service domain with largerresult.

That is, when the result of the calculation based on the weights appliedin the broadcast service domain is larger than the result of thecalculation based on the weights applied in the VOD service domain, thecontroller 330 may determine that “Change to XXX (i.e., broadcastprogram name), please” belongs to the broadcast service domain. On thecontrary, when the result of the calculation based on the weightsapplied in the VOD service domain is larger than the result of thecalculation based on the weights applied in the broadcast servicedomain, the controller 330 may determine that “Change to XXX (i.e.,broadcast program name), please” belongs to the VOD service domain.

However, one of ordinary skill in the art will understand that the aboveis only provided for illustrative purposes. Accordingly, the controller330 may statistically determine the service domain to which the uservoice belongs, according to various methods.

Thereafter, the controller 330 extracts a dialogue act, main action, andcomponent slot (or entity name) from the user voice, based on theservice domain to which the user voice belongs.

The ‘dialogue act’ is the category related to the form of sentences,which indicates whether the corresponding sentence is a statement, arequest, or a question.

The ‘main action’ refers to meaningful information that represents anact intended by the corresponding utterance in the dialogue in aspecific domain. For example, in the broadcast service domain, the mainaction may include TV power on/off, broadcast program search, broadcastprogram time search, or broadcast program reserve. In another example,in the device control service domain, the main action may include devicepower on/off, play, or pause.

The ‘component slot’ refers to entity information of a specific domainthat appears in the utterance, i.e., refers to additive information thatspecifies the meaning of an action as intended by a specific domain. Forexample, the component slot in the broadcast service domain may includegenre, broadcast program name, broadcast time, channel name oractor/actress name, and may include in the device control service domainthe name of external device, the manufacturer, or the like.

The controller 330 may determine the intention of utterance included inthe user voice, using the extracted dialogue act, main action, andcomponent slot, and generate response information corresponding to thedetermined utterance intention and transmit this to the displayapparatus 100.

The response information may include system response information tooutput a system response from the display apparatus 100. To thatpurpose, the controller 330 may extract a reply to the determinedutterance intention from the storage 320, and convert the extractedreply into text to thus generate system response information. Further,the controller 330 may search information corresponding to thedetermined utterance intention and generate system response informationusing the searched information.

Further, the response information may include a control command toexecute a function that corresponds to the user voice. The controlcommand may include a control command to control the display apparatus100 to perform the function corresponding to the user voice. To thatpurpose, the controller 330 may generate a control command correspondingto the determined utterance intention and control the communicator 310such that the generated control command is transmitted to the displayapparatus 100.

Hereinbelow, a method of the controller 330 for generating responseinformation corresponding to user voice will be explained in detail byreferring to exemplary embodiments.

First, the controller 330 may extract a dialogue act, main action, andcomponent slot from the user voice, using the information tagged to thesample sentence that is matched with the user voice, or that isstatistically determined to be similar to the user voice, generateresponse information corresponding to the user voice, and transmit thesame to the display apparatus 100.

For example, it is assumed that the text “When is XXX (i.e., broadcastprogram name) on?” is received from the display apparatus 100.

In the above example, the controller 330 may determine that the receivedtext belongs to the broadcast service domain, extract a dialogue act,main action, and component slot from the user voice, using theinformation tagged to the sample sentence “When is XXX (i.e., broadcastprogram name) on?” that is matched with the received text in thebroadcast service domain, and generate corresponding responseinformation.

That is, the sample sentence “When is XXX (i.e., broadcast program name)on?” stored in the broadcast service domain is tagged with theinformation to interpret the sample sentence, including the informationthat indicates that “XXX (i.e., broadcast program name)” refers to abroadcast program, that “when” refers to request for broadcast time, andthat “?” indicates that the sample sentence is a question. Accordingly,the controller 330 may determine that, based on the above information,the dialogue act of the sample sentence “When is XXX (i.e., broadcastprogram name) on?” is a question, the main action is the inquiry forbroadcast time, and the component slot is XXX (i.e., broadcast programname). Accordingly, the controller 330 may determine that the utteranceintention included in the user voice is to “inquire” about the“broadcast time” of “XXX”.

Further, the controller 330 may search the storage 320 for the responsetagged to the sample sentence “When is XXX (i.e., broadcast programname) on?” stored in the broadcast service domain, and generate systemresponse information using the tagged response.

That is, the controller 330 may search for a response corresponding tothe user voice, “The broadcast time of <broadcast program name> youinquired about is <broadcast time>,” which is tagged to the samplesentence “When is XXX (i.e., broadcast program name) on?” stored in thebroadcast service domain.

In the above example, the controller 330 may fill in the blank appearingin the searched response and generate a complete sentence.

For example, in the response “The broadcast time of <broadcast programname> you inquired about is <broadcast time>,” the controller 330 maywrite in “XXX (i.e., broadcast program name)” in the blank <broadcastprogram name>. The controller 330 may also search for the broadcast timeof the “XXX (i.e., broadcast program name)” based on the electronicprogram guide (EPG) of the “XXX (i.e., broadcast program name)” andwrite in another blank, <broadcast time>. As a result, the controller330 may generate a complete sentence, “The broadcast time of the XXX(i.e., broadcast program name) you inquired about is seven o'clock,Saturday” as the system response information and transmit the generatedsystem response information to the display apparatus 100.

Accordingly, the display apparatus 100 may output “The broadcast time ofthe XXX (i.e., broadcast program name) you inquired about is seveno'clock, Saturday” in the form of at least one of a voice and a UIscreen, based on the system response information received from thesecond server 300.

In another exemplary embodiment, it is assumed that the text “What is onTV today?” is received from the display apparatus 100.

In the above example, the controller 330 may determine that the receivedtext belongs to the broadcast service domain, extract a dialogue act,main action and component slot from the user voice, using theinformation tagged to the sample sentence “What is on TV today?” that ismatched with the received text in the broadcast service domain, andgenerate corresponding response information.

That is, the sample sentence “What is on TV today?” stored in thebroadcast service domain is tagged with the information to interpret thesample sentence, including the information that indicates that “today,”“on TV,” and “What” indicate an inquiry about a broadcast programscheduled to be broadcasted today, and that “?” indicates that thesample sentence is a question. Accordingly, the controller 330 maydetermine that, based on the above information, the dialogue act of thesample sentence “What is on TV today?” is a question, the main action isthe inquiry about broadcast program scheduled to be broadcasted today,and the component slot is the broadcast program. Accordingly, thecontroller 330 may determine that the utterance intention included inthe user voice is to “inquire” about the “broadcast programs” to bebroadcasted “today.”

In one exemplary embodiment, the controller 330 may refer to an EPG tosearch today's broadcast programs, and control the communicator 310 totransmit to the display apparatus 100 the information about the searchedbroadcast program which may include at least one of content name, genre,broadcast time, cast, corresponding thumbnail images, channel name wherethe content is broadcasted, or production crew.

The controller 330 may also control the communicator 310 to transmit theinformation about the determined utterance intention to the displayapparatus 100. That is, the controller 330 may transmit a controlcommand to output a list of today's broadcast programs to the displayapparatus 100, when determining that the user's utterance intentionrelates to an inquiry about information about the broadcast programsscheduled to be broadcasted today.

Further, the controller 330 may control the communicator 310 such thatthe information about the searched broadcast program and informationabout the determined utterance intention are transmitted together to thedisplay apparatus 100.

Accordingly, the display apparatus 100 may output the list of today'sbroadcast programs in the form of at least one of a voice and a UIscreen, based on the information received from the second server 300.

In yet another exemplary embodiment, it is assumed that the text “Changeto #X (i.e., channel number), please” is received from the displayapparatus 100.

In the above example, the controller 330 may determine that the receivedtext belongs to the device control service domain, extract a dialogueact, main action, and component slot from the user voice, using theinformation tagged to the sample sentence “Change to #X (i.e., channelnumber), please” that is matched with the received text in the devicecontrol service domain, and generate corresponding response information.

That is, the sample sentence “Change to #X (i.e., channel number),please” stored in the device control service domain is tagged with theinformation to interpret the sample sentence, including the informationthat indicates that “#X (i.e., channel number)” refers to channelnumber, that “Change” refers to a command to tune to a broadcast, andthat “please” indicates that the sample sentence is a request.Accordingly, the controller 330 may determine that, based on the aboveinformation, the dialogue act of the sample sentence “Change to #X(i.e., channel number), please” is a request, the main action is thecommand to tune to a broadcast, and the component slot is #X (i.e.,channel number). Accordingly, the controller 330 may determine that theutterance intention included in the user voice is to “request”“broadcast tuning” to “#X”.

Further, the controller 330 may search the storage 320 for a responsetagged to the sample sentence “Change to #X (i.e., channel number),please” stored in the broadcast service domain, and generate responseinformation using the tagged response.

That is, the controller 330 may search a tagged response, “Channelchanged to <channel number>” stored in the broadcast service domain, asa response corresponding to the user voice, “Change to #X (i.e., channelnumber), please.”

In the above example, the controller 330 may fill in a blank appearingin the searched response to generate a complete sentence.

For example, the controller 330 may write “#X (i.e., channel number)” inthe blank <channel number> appearing in the response like “Channelchanged to <channel number>.” Accordingly, the controller 330 maygenerate a complete sentence form of “Channel changed to #X” as thesystem response information corresponding to the user voice, andtransmit the generated system response information to the displayapparatus 100.

Further, the controller may search the storage 320 for a control commandtagged to the sample sentence “Change to #X (i.e., channel number),please” stored in the device control service domain, and control thecommunicator 310 such that the tagged control command is transmitted tothe display apparatus 100. That is, the controller 330 may transmit atagged control command to change a channel of the display apparatus 100to #X, corresponding to the sample sentence “Change to #X (i.e., channelnumber), please” stored in the device control service domain, to thedisplay apparatus 100.

As a result, the display apparatus 100 may change a channel to #X basedon the control command received from the second server 300, and output“Channel changed to #X” in the form of at least one of a voice and a UIscreen, based on the system response information received from thesecond server 300.

Meanwhile, the controller 330 may generate a control command to executea function of the display apparatus 100 based on the control commandtagged to sample sentence, but is not limited thereto.

That is, the controller 330 may generate a control command based on adetermined utterance intention, and transmit the same to the displayapparatus 100. For example, when determining that the user's utteranceintention is to request broadcast tuning to #X, the controller 330 maygenerate a control command to change a channel to #X and transmit thecontrol command to the display apparatus 100.

Further, the controller 330 may transmit the control command to executea function of the display apparatus 100, along with the system responseinformation to output system response, but is not limited thereto.Accordingly, the controller 330 may transmit only the control command toexecute a function of the display apparatus 100 to the display apparatus100, depending on the user's utterance intention.

In the above exemplary embodiment, the display apparatus 100 may notoutput a separate system response, or output a system response usingpre-stored data in the display apparatus 100.

In another exemplary embodiment, it is assumed that the text “Execute XX(i.e., application name), please” is received from the display apparatus100.

In the above example, the controller 330 may determine that the receivedtext belongs to the device control service domain, extract a dialogueact, main action, and component slot from the user voice, using theinformation tagged to the sample sentence “Execute XX (i.e., applicationname), please” that is matched with the received text in the devicecontrol service domain, and generate corresponding response information.

That is, the sample sentence “Execute XX (i.e., application name),please” stored in the device control service domain is tagged with theinformation to interpret the sample sentence, including the informationthat indicates that “XX (i.e., application name)” refers to a name of anapplication, that “Execute” refers to a command to execute anapplication, and that “please” indicates that the sample sentence is arequest. Accordingly, the controller 330 may determine that, based onthe above information, the dialogue act of the sample sentence “ExecuteXX (i.e., application name), please” is a request, the main action isthe command to execute an application, and the component slot is XX(i.e., application name). Accordingly, the controller 330 may determinethat the utterance intention included in the user voice is to “request”“application execution” regarding “XX (i.e., application name).”

In one exemplary embodiment, the controller may search the storage 320for a control command tagged to the sample sentence “Execute XX (i.e.,application name), please” stored in the device control service domain,and control the communicator 310 such that the tagged control command istransmitted to the display apparatus 100. That is, the controller 330may transmit a tagged control command to execute XX (i.e., applicationname) on the display apparatus 100, corresponding to the sample sentence“Execute XX (i.e., application name), please” stored in the devicecontrol service domain, to the display apparatus 100.

As a result, the display apparatus 100 may execute XX (i.e., applicationname), based on the control command received from the second server 300.

Meanwhile, the controller 330 may generate a control command to executea function of the display apparatus 100 based on the control commandtagged to sample sentence, but is not limited thereto.

That is, the controller 330 may generate a control command based on thedetermined utterance intention and transmit the same to the displayapparatus 100. For example, when determining that the user's utteranceintention is to request execution of XX (i.e., application name), thecontroller 330 may generate a control command to execute XX (i.e.,application name) and transmit the same to the display apparatus 100.

Meanwhile, the controller 330 may extract s dialogue act, main action,and component slot from the user voice by using the information taggedto the sample sentence that is statistically determined to be similar tothe user voice, and generate corresponding response information.

For example, it is assumed that the text “When is YYY (i.e., broadcastprogram name) on?” is received from the display apparatus 100.

In the above example, the controller 330 may determine that the text“When is YYY (i.e., broadcast program name) on?” is statisticallysimilar to the sample sentence “When is XXX (i.e., broadcast programname) on?” stored in the broadcast service domain. Accordingly, thecontroller 330 may extract the dialogue act, main action, and componentslot from the user voice by using the information tagged to the samplesentence “When is XXX (i.e., broadcast program name) on?” stored in thebroadcast service domain, and generate corresponding responseinformation.

That is, the sample sentence “When is XXX (i.e., broadcast program name)on?” stored in the broadcast service domain is tagged with informationto interpret the same, which indicates that the word related withbroadcast program may be filled in the sentence pattern “When is ˜ on?”Accordingly, the controller 330 may search for the meaning of “YYY(i.e., broadcast program name)” among the words related with thebroadcast program such as broadcast program name, appearing person, orproduction manager.

To that purpose, the storage 320 may include an entity name dictionaryor TIMEX dictionary which stores information about the components foreach of the service domains.

That is, referring to entity name dictionary or TIMEX dictionary, thecontroller 330 may search for the meaning of “YYY (i.e., broadcastprogram name)” and determine that “YYY (i.e., broadcast program name)”refers to a broadcast program.

However, other exemplary embodiments are also possible. For example, thecontroller 330 may search for the meaning of “YYY (i.e., broadcastprogram name)” by using an EPG, or the like, or search for the meaningof “YYY (i.e., broadcast program name)” by using pre-stored samplesentences and a conditional random field (CRF).

Accordingly, the controller 330 may determine that the dialogue act ofthe text “When is YYY (i.e., broadcast program name) on?” received fromthe display apparatus 100 is a question, the main action is to inquirebroadcast time, and the component slot is YYY (i.e., broadcast programname). The controller 330 may also determine that the utteranceintention included in the user voice is to “inquire” about “broadcasttime” of “YYY.”

Further, the controller 330 may generate system response informationabout “When is YYY (i.e., broadcast program name) on?”, by using theresponse tagged to the sample sentence “When is XXX (i.e., broadcastprogram name) on?” stored in the broadcast service domain, and transmitthe generated system response information to the display apparatus 100.

That is, the controller 330 may search for a response corresponding tothe user voice, “The broadcast time of <broadcast program name> youinquired about is <broadcast time>”, which is the tagged response to thesample sentence “When is XXX (i.e., broadcast program name) on?” storedin the broadcast service domain. The controller 330 may also generatesystem response information corresponding to the user voice based on thecomplete sentence, “The broadcast time of YYY (i.e., broadcast programname) is 11 o'clock, Wednesday”, and transmit the generated systemresponse information to the display apparatus 100.

Meanwhile, when some of the sentence data of the system responseoutputted from the display apparatus 100 is stored, the controller 330may control the communicator 310 such that only certain text may betransmitted to the display apparatus 100 to complete the correspondingsentence.

For example, when the response such as “The broadcast time of <broadcastprogram name> you inquired about is <broadcast time>” is pre-stored inthe display apparatus 100, the controller 330 may control thecommunicator 310 such that the broadcast program name inquired about bythe user and the broadcast time thereof are transmitted in the text formto the display apparatus 100 to complete the response stored in thedisplay apparatus 100. In one exemplary embodiment, the controller 330may transmit a separate control signal to output a pre-stored responseto the display apparatus 100, to the display apparatus 100.

Accordingly, the display apparatus 100 may write the text received fromthe second server 300 in the pre-stored response, to thus output thesystem response, which may be, “The broadcast time of XXX (i.e.,broadcast program name) is 7 o'clock, Saturday.”

Meanwhile, the controller 330 may extract the dialogue act, main action,and component slot using the information tagged to the sample sentence,but is not limited thereto. That is, the controller 330 may extract thedialogue act and main action from the user voice by using a maximumentropy classifier (MaxEnt), and extract the component slot by using theconditional random field (CRF).

Of course, the exemplary embodiments are not limited to any specificexamples only. Accordingly, the controller 330 may extract the dialogueact, the main action, and the component slot from the user voice in manyknown manners.

Meanwhile, when the controller 330 is not able to determine theutterance intention of the user included in the currently-received uservoice, the controller 330 may refer to the previously-received uservoices to determine the user's utterance intention included in thecurrently-received user voice.

More specifically, when the controller 330 is not able to determine theuser's utterance intention included in the currently-received user voicebased on the service domain to which the previously-received user voicebelongs, the controller 330 may determine the user's utterance intentionincluded in the currently-received user voice, by referring to thepreviously-received user voices.

For example, the controller 330 may determine whether or not thecurrently-received user voice corresponds to the initial user utterancein the dialogue pattern, by comparing the currently-received user voicewith the dialogue pattern stored in the corpus database, and in responseto determining that it does not correspond to the initial userutterance, may determine the user's utterance intention included in thecurrently-received user voice by referring to the previously-receiveduser voices.

To that purpose, the storage 320 may store the respective samplesentences of the respective service domains in a hierarchical manner.That is, the storage 320 may store the top hierarchy level based on thesample sentence that matches the user voice that can be initiallyuttered by the user in each service domain, and store the samplesentences corresponding to the user voices that can follow thereafter ina hierarchical manner.

For example, the storage 320 may store “What is on TV today?” as the tophierarchy layer in the broadcast service domain, and store the lowerlayer sample sentences such as “The third one,” “Can I see the detailsof the third one, please,” “What is on XXX (i.e., channel name)?”,“Which is the documentary program?”, “Can I see the program thatfeatures XXX (i.e., appearing persons' name)?”, or “Can I see XXX (i.e.,broadcast program name), please?”

In another exemplary embodiment, the storage 320 may store “Execute XXX(i.e., application name), please” as the top hierarchy layer in thedevice control service domain. When the “XXX (i.e., application name)”is a web browsing application, the storage 320 may store the lower layersample sentences of “Execute XXX (i.e., application name), please,”based on the sample sentences executable on the web browsing applicationsuch as, “home page,” “favorites,” “refresh,” “open new page,” “closecurrent page,” “backward,” “forward,” or “end.”

A method for determining the user's utterance intention included in thecurrently-received user voice with reference to the previously-receiveduser voices, will be explained in greater detail below.

First, it is assumed that the user voice “What is on TV today?” isinputted, and that the user voice “What is on XXX (i.e., channel name)today?” is inputted thereafter. In the above example, when thecontroller 330 determines that the user voice “What is on XXX (i.e.,channel name) today?” does not corresponds to the initial user utterancein the broadcast service domain, the controller 330 may determine theutterance intention of “What is on XXX (i.e., channel name)?” based onthe previously-received user voice “What is on TV today?”

Accordingly, the controller 330 may determine that, using thepreviously-received user voice “What is on TV today?”, the utteranceintention of “What is on XXX (i.e., channel name)?” is to “inquire”about “broadcast program” scheduled to be broadcasted on “XXX (i.e.,channel name)” among the programs broadcasted today.

Accordingly the controller 330 may generate response informationcorresponding to the determined utterance intention and transmit thesame to the display apparatus 100. That is, according to the determinedutterance intention, the controller 330 may search for the broadcastprogram broadcasted on XXX (i.e., channel name) among the today'sbroadcast programs, and transmit the information about the searchedbroadcast program to the display apparatus 100.

Further, it is assumed that the user voice “What is on TV today?” isinputted, and the user voice “The third one” is inputted thereafter. Inthe above example, when the controller 330 determines that the uservoice “The third one” does not correspond to the initial user utterancein the broadcast service domain, the controller 330 may determine theutterance intention of “The third one” based on the previously-receiveduser voice “What is on TV today?”

More specifically, the controller 330 may determine that the utteranceintention is to “request” “tuning” to “broadcast program” which is the“third one” on the list of the broadcast programs outputted from thedisplay apparatus 100 in response to the previously-received user voice“What is on TV today?” Accordingly, the controller 330 may generateresponse information corresponding to the determined utterance intentionand transmit the same to the display apparatus 100. That is, thecontroller 330 may transmit a control command, to tune to a broadcastprogram that is the third one on the list of broadcast programsoutputted from the display apparatus 100, to the display apparatus 100,according to the determined utterance intention.

Meanwhile, although it is assumed that the user voice “What is on TVtoday?” is inputted and followed by “What is on XXX (i.e., channelname)” and “The third one,” the exemplary embodiments are not limited tothe above specific examples only. That is, various other hierarchicalsample sentences may also be interpreted based on thepreviously-received user voices.

In another exemplary embodiment, it is assumed that the user voice“Execute XXX (i.e., web browsing application name), please” is inputted,and then followed by the user voice “refresh.” In the above example,when the controller 330 determines that the user voice “refresh” doesnot correspond to the initial user utterance in the device controlservice domain, the controller 330 may determine the utterance intentionof “refresh” based on the previously-received user voice “Execute XXX(i.e., web browsing application name), please.”

That is, the controller 330 may determine that the utterance intentionis to “request” to “refresh” the “web browsing screen,” by using thepreviously-received user voice “Execute XXX (i.e., web browsingapplication name), please.”

As a result, the controller 330 may generate response informationcorresponding to the determined utterance intention and transmit thesame to the display apparatus 100. That is, in the exemplary embodimentsexplained above, the controller 330 may transmit a control command, torefresh the web browsing screen, to the display apparatus 100.

Meanwhile, although it is assumed that the user voice “Execute XXX(i.e., web browsing application name), please” is inputted, and thenfollowed by the user voice “refresh,” the exemplary embodiments are notlimited to specific example only. Accordingly, various otherhierarchical sample sentences may be interpreted based on thepreviously-received user voices.

Further, although not limited thereto, the controller 330 may determinewhether or not the currently-received user voice corresponds to theinitial user utterance in the dialogue pattern, and when determiningthat it does not correspond to the initial user utterance, thecontroller 330 may determine the user's utterance intention included inthe currently-received user voice by referring to thepreviously-received user voice.

Accordingly, the controller 330 may use the previously-received uservoice when it is not possible to determine the utterance intentionincluded in the currently-received user voice, by referring to theservice domain that includes the previously-received user voice.

For example, it is assumed that the user voice “What is on TV today?” isinputted and then is followed by the user voice “What is on XXX (i.e.,channel name)?” Since it is not possible to determine the utteranceintention included in the user voice “What is on XXX (i.e., channelname)?” in the broadcast service domain, the controller 330 maydetermine the utterance intention of “What is on XXX (i.e., channelname)?” based on the previously-received user voice “What is on TVtoday?”

That is, the controller 330 may determine that the utterance intentionof “What is on XXX (i.e., channel name)?” is to “inquire” about“broadcast program” broadcasted on “XXX (i.e., channel name)” among theprograms broadcasted today, by using the previously-received user voice“What is on TV today?”

In another exemplary embodiment, it is assumed that the user voice“Execute XXX (i.e., web browsing application name), please” is inputtedand then followed by the user voice “refresh.” In the above example,since it is not possible to determine the utterance intention includedin the user voice “refresh” in the device control service domain, thecontroller 330 may determine the utterance intention of “refresh” basedon the previously-received user voice “Execute XXX (i.e., web browsingapplication name), please.”

That is, by using the previously-received user voice “Execute XXX (i.e.,web browsing application name), please,” the controller 330 maydetermine that the utterance intention of “refresh” is to “request” to“refresh” the “web browsing screen.”

Meanwhile, the storage 320 may store the information about the uservoice that is outputted from the display apparatus 100 as the voicecommand guide. That is, the storage 320 may store the information aboutthe user voice usable in each situation of the display apparatus 100.

As a result, the controller 330 may determine the information about uservoice applicable in the current situation of the display apparatus 100by using the pre-stored information, and transmit the determinedinformation about the user voice to the display apparatus 100.

For example, when the user voice “What is on TV today?” is received, thecontroller 330 may determine the utterance intention of “What is on TVtoday?” is to “inquire” about “broadcast program” broadcasted “today,”and transmit the information about today's broadcast program to thedisplay apparatus 100. The controller 330 may transmit information aboutuser voices applicable for the broadcast program list outputted as thesystem response, such as, for example, “The third one,” “Can I see thedetails of the third one, please,” “What is on XXX (i.e., channelname)?”, “Which is the documentary program?”, “Can I see the programthat features XXX (i.e., appearing persons' name)?”, or “Can I see XXX(i.e., broadcast program name), please?”

In another exemplary embodiment, when the user voice “Execute XX (i.e.,name of application related to web browsing), please” is received, thecontroller 330 may determine the utterance intention of “Execute XX(i.e., name of application related to web browsing), please” is to“request” to “execute” the “XX (i.e., name of application related to webbrowsing),” and transmit a control command to execute the XX (i.e., nameof application related to web browsing) to the display apparatus 100.The controller 330 may transmit information about user voices applicablefor the application executed on the display apparatus 100, such as, forexample, “home page,” “favorites,” “refresh,” “open new page,” “closecurrent page,” “backward,” “forward,” or “end.”

Various exemplary embodiments will be explained below with reference toFIGS. 6 to 11.

Referring to FIG. 6, the controller 150 displays initial screen 410.Referring to FIGS. 7A to 7C, the controller 150 may display voicecommand guide 420 on a predetermined area of the initial screen 410.

The voice command guide 420 may display text in a slide show formrepresenting user voices that can execute the available functions of thedisplay apparatus 100, such as, for example, “What is on TV today?”,“Anything fun?”, “Any new movies?”, “Give me recommendations,” “Can Iwatch EBB (i.e., channel name),” and “Execute web browser, please.”

Referring to FIGS. 8A to 8C, when the user utters “What is on TVtoday?”, the controller 150 may output a list 430 of broadcast programsto be broadcasted today as the system response, based on the responseinformation received from the second server 300. The controller 150 maythen output voice command guide 440 representing the user voiceapplicable on the list 430 outputted as the system response.

Referring to FIGS. 8A to 8C, the voice command guide 440 may displaytext in the slide show form representing the user voice that isapplicable to the list 430 of broadcast programs outputted as the systemresponse, such as, for example, “The third one, please” “Can I seedetails of the third one?”, “What is on SBC (i.e., channel name)?”. “CanI see documentary programs?”, “Can I see the program that features Peter(i.e., appearing persons' name)?”, or “Can I see “The Show” (i.e.,broadcast program name), please?”

Meanwhile, the controller 150 may output information about the collecteduser voice. For example, when the text information corresponding to theuser voice is received from the first server 200, the controller 150 maygenerate a UI that includes the received text information and output thesame on the screen. That is, referring to FIGS. 8A to 8C, the controller150 may output “What is on TV today?” 450.

Accordingly, when the user utters “What is on SBC (i.e., channel name)?”the controller 150 may output, as the system response, a list ofbroadcast programs provided by SBC (i.e., channel name) among thetoday's broadcast programs, based on the response information receivedfrom the second server 300.

Meanwhile, the controller 150 may output guide to ensure correct uservoices. To that purpose, the second server 300 may transmit the guideinformation to ensure correct user voices to the display apparatus 100.

That is, when there is no sample sentence matching the received textinformation, the controller 330 may determine the statistically similarsample sentence, as explained above. Accordingly, the controller 330 maytransmit the response information to the display apparatus 100, togetherwith information about the sample sentence that is statistically similarto the received user voice.

For example, referring to FIG. 9, when the user utters “What is on TV?”,the controller 330 determines that “What is on TV?” is statisticallysimilar to “What is on TV today?” among the pre-stored sample sentences,and express “What is on TV today?” and transmit the same to the displayapparatus 100. Accordingly, the controller 150 may generate a UI such as“Try saying “What is on TV today” next time” and output this as theguide 460 to ensure more accurate user voice.

In one exemplary embodiment, the controller 150 may output informationabout collected user voice. Referring to FIG. 9, the controller 150 mayoutput “What is on TV?” 470.

Meanwhile, the form of the voice command guide is not limited to thespecific example provided above, and accordingly, various forms of thevoice command guide can be provided.

For example, referring to FIGS. 10A and 10B, when the user voice “Whatis on TV today?” is collected in a situation that the initial screen 510is outputted, the controller 150 may output a system response based on alist 520 of programs to be broadcasted today, and output voice commandguide 530 that includes information about the user voice usable withrespect to the list 520. More specifically, referring to FIG. 10B, thecontroller 150 may output voice command guide 530 such as “Say “Thethird one, second one, . . . ”” and so on.

Meanwhile, FIG. 11 illustrates an example where the user utters “Seemore” in a state where the initial screen is displayed. In this example,the second server 300 may transmit a control command to outputinformation about usable user voices on the display apparatus 100, tothe display apparatus 100.

Accordingly, based on the received control command, the controller 150may generate UI 610 that includes information about the usable uservoices (see FIG. 11), and output the same. Meanwhile, the informationabout usable voice at the display apparatus 100 may be pre-stored in thedisplay apparatus 100 or received from the second server 300.

FIG. 12 is a flowchart provided to explain a control method of a displayapparatus according to an exemplary embodiment.

At S1210, the user voice is collected.

At S1220, the user voice is transmitted to the first server 200, andtext information corresponding to the user voice is received from thefirst server 200. At S1230, the received text information is transmittedto the second server 300.

At S1240, when the response information corresponding to the textinformation is received from the second server 300, based on theresponse information, the system response corresponding to the uservoice is outputted. The voice command guide applicable to the currentsituation of the display apparatus may be outputted.

That is, when the system response corresponding to the user voice isoutputted, the voice command guide related with the system response maybe outputted.

In the above example, when a list of contents searched in response tothe user voice to search contents is outputted, a voice command guide tofilter the content included in the content list may be outputted.

Further, when the list of contents searched for in response to the uservoice to search contents is outputted, at least one of a voice commandguide to execute specific content included in the content list and avoice command guide to output details of specific content may beoutputted.

Further, when an application executing screen is outputted in responseto user voice to execute an application, a voice command guideapplicable on the application executing screen may be outputted.

The above will be referenced to the explanation provided above withreference to FIGS. 1 to 11, and will not be redundantly explained belowfor the sake of brevity.

Further, a non-transitory computer readable medium recording thereinprogram to sequentially perform a control method according to exemplaryembodiments may be provided. The ‘non-transitory computer readablerecording medium’ refers to a medium which stores data semi-permanentlyand can be read by devices. Specifically, the above various applicationsor programs may be stored and provided in non-transitory computerreadable recording medium such as CD, DVD, hard disk, Blu-ray disk, USB,memory card, or ROM.

Further, although the block diagrams of the display apparatus andservers omit a bus, communication among the respective components of thedisplay apparatus and servers may be performed via the bus. Further,each device may additionally include at least one of a processor such asCPU, a microprocessor, a hardware module, or a circuit to performvarious steps explained above.

The foregoing exemplary embodiments and advantages are merely exemplaryand are not to be construed as limiting the exemplary embodiments. Thepresent teaching can be readily applied to other types of apparatuses.Also, the description of the exemplary embodiments is intended to beillustrative, and not to limit the scope of the inventive concept, asdefined by the appended claims, and many alternatives, modifications,and variations will be apparent to those skilled in the art.

What is claimed is:
 1. An electronic apparatus, comprising: an output; avoice collector configured to collect a user voice; and a controllerconfigured to control the output to output a system responsecorresponding to the user voice, wherein the controller is furtherconfigured to control the output such that a voice command guideapplicable to a current situation of the electronic apparatus isoutputted.
 2. The electronic apparatus of claim 1, wherein, in responseto the system response corresponding to the user voice being outputted,the controller is further configured to control the output to output thevoice command guide including a guide which is related with the systemresponse.
 3. The electronic apparatus of claim 1, wherein, in responseto a list of contents, resulting from a search performed in accordancewith the user voice including a command to search for contents, beingoutputted, the controller is further configured to control the output tooutput the voice command guide including a guide to filter the contentincluded in the list of contents.
 4. The electronic apparatus of claim1, wherein, in response to a list of contents, resulting from a searchperformed in accordance with the user voice including a command tosearch for contents, being outputted, the controller is furtherconfigured to control the output to output the voice command guideincluding at least one of a guide to execute specific content includedin the list of contents, and a guide to output details of the specificcontent.
 5. The electronic apparatus of claim 1, wherein, in response toan application executing screen, corresponding to the user voiceincluding a command to execute the corresponding application, beingoutputted, the controller is further configured to control the output tooutput the voice command guide including a guide applicable to theapplication executing screen, and wherein the outputting the applicationexecuting screen comprises displaying the application executing screenon a screen included in the electronic apparatus.
 6. The electronicapparatus of claim 1, wherein the electronic apparatus is a displayapparatus.
 7. The electronic apparatus of claim 1, further comprising: afirst communicator configured to transmit the user voice to a firstserver and receive from the first server text information correspondingto the user voice; and a second communicator configured to transmit thereceived text information to a second server, wherein in response toresponse information corresponding to the text information beingreceived from the second server, the controller is further configured tocontrol the output such that the system response is outputted based onthe response information.
 8. A control method of an electronicapparatus, comprising: collecting a user voice; transmitting the uservoice to a first server and receiving from the first server textinformation corresponding to the user voice; transmitting the receivedtext information to a second server; and in response to responseinformation corresponding to the text information being received fromthe second server, outputting a system response corresponding to theuser voice based on the response information and outputting a voicecommand guide applicable to a current situation of the electronicapparatus.
 9. The control method of claim 8, wherein, in response to alist of contents, resulting from a search performed in accordance withthe user voice including a command to search for contents, beingoutputted, outputting the voice command guide including a guide tofilter the content included in the list of contents.
 10. The controlmethod of claim 8, wherein, in response to a list of contents, resultingfrom a search performed in accordance with the user voice including acommand to search for contents, being outputted, outputting the voicecommand guide including at least one of a guide to execute specificcontent included in the content list and a guide to output details ofthe specific content.
 11. The control method of claim 8, wherein, inresponse to an application executing screen, corresponding to the uservoice including a command to execute the corresponding application,being outputted, outputting the voice command guide including a guideapplicable to the application executing screen.
 12. The control methodof claim 8, wherein the electronic apparatus is a display apparatus.